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Figure 1. 158P1D7 SSH sequence (SEQ ID NO: 655) 



1 GATCTGATAA GCTTTCAATG TTGCGCTCCT GACAATGTAT TAGAAGTCCT GATGGGGATA 
61 GGACTTTGCA GTTACAAGGA ATAGGGCAGA AAGGTCCTGG AAGTTGAGTG GATGGCTTTG 
121 TAATATAAGG TATCAAACCT GGTGCTTTGG TGGGTAGTTT TAGAATGGAC GTGGTCTTAG 
181 TTGACATGCG ACTATCATTT ATTGAAGATG TTGCTGCCAG ATGTAATGAT C 
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Figure 2. 158P1D7 cDNA clone TurboScript3PX and open reading frame (ORF) 



1 MKLWIHLFYSSLL 
1 tcggatttcatcacatgacaacATGAAGCTGTGGATTCATCTCTTTTATTCATCTCTCCT 
14ACISLHSQTPVLSSRGSCDS 
6 1 TGCCTGTATATCTTTACACTCCCAAACTCCAGTGCTCTCATCCAGAGGCTCTTGTGATTC 
34LCNCEEKDGTMLINCEAKGI 

121 TCTTTGCAATTGTGAGGAAAAAGATGGCACAATGCTAATAAATTGTGAAGCAAAAGGTAT 
54 KMVSEISVPPSRPFQLSLLN 

181 CAAGATGGTATCTGAAATAAGTGTGCCACCATCACGACCTTTCCAACTAAGCTTATTAAA 
74NGLTMLHTNDFSGLTNAIS I 

241 TAACGGCTTGACGATGCTTCACACAAATGACTTTTCTGGGCTTACCAATGCTATTTCAAT 
94HLGFNNIADIEIGAFNGLGL 

3 01 ACACCTTGGATTTAACAATATTGC AGATATTGAGATAGGTGCATTTAATGGC CTTGGC CT 

114 LKQLHINH3STSLE ILKEDTFH 

3 61 CCTGAAACAACTTCATATCAATCACAATTCTTTAGAAATTCTTAAAGAGGATACTTTCCA 
134 GLENLEFLQADNNFITVIEP 
421 TGGACTGGAAAAC CTGGAATTC CTGC AAGCAGATAACAATTTTATCACAGTGATTGAACC 
154 SAFSKLNRLKVLILNDNAIE 

4 81 AAGTGCCTTTAGCAAGCTCAACAGACTCAAAGTGTTAATTTTAAATGACAATGCTATTGA 
174 SLPPNI FRFVPLTHLDLRGN 
541 GAGTCTTCCTCCAAACATCTTCCGATTTGTTCCTTTAACCCATCTAGATCTTCGTGGAAA 
194 QLQTLPYVGFLEHIGRILDL 
601 TCAATTACAAACATTGCCTTATGTTGGTTTTCTCGAACACATTGGCCGAATATTGGATCT 
214 QLEDNKWACNCDLLQLKTWL 
661 TCAGTTGGAGGACAACAAATGGGCCTGCAATTGTGACTTATTGCAGTTAAAAACTTGGTT 
234 ENMPPQSIIGDVVCNSPPFF 
721 GGAGAACATGCCTCCACAGTCTATAATTGGTGATGTTGTCTGCAACAGCCCTCCATTTTT 
254 KGSILSRLKKESICPTPPVY 
781 TAAAGGAAGTATACTCAGTAGACTAAAGAAGGAATCTATTTGCCCTACTCCACCAGTGTA 
274 EEHEDPSGSLHLAATSS IND 
841 TGAAGAACATGAGGATCCTTCAGGATCATTACATCTGGCAGCAACATCTTCAATAAATGA 
294 SRMSTKTTS ILKLPTKAPGL 
901 TAGTCGCATGTCAACTAAGACCACGTCCATTCTAAAACTACCCACCAAAGCACCAGGTTT 
314 IPYITKPSTQLPGPYCPIPC 
961 GATACCTTATATTACAAAGCCATCCACTCAACTTCCAGGACCTTACTGCCCTATTCCTTG 
334 NCKVLSPSGLLIHCQERNIE 

1021 TAACTGCAAAGTCCTATCCCCATCAGGACTTCTAATACATTGTCAGGAGCGCAACATTGA 

354 SLSDLRPPPQNPRKLILAGN 
1081 AAGCTTATCAGATCTGAGACCTCCTCCGCAAAATCCTAGAAAGCTCATTCTAGCGGGAAA 

374 IIHSLMKSDLVEYFTLEMLH 
1141 TATTATTCACAGTTTAATGAAGTCTGATCTAGTGGAATATTTCACTTTGGAAATGCTTCA 

394 LGNNRIEVLEEGSFMNLTRL 
1201 CTTGGGAAACAATCGTATTGAAGTTCTTGAAGAAGGATCGTTTATGAACCTAACGAGATT 

414 QKLYLNGNHLTKIiSKGMFLG 
1261 ACAAAAACTCTATCTAAATGGTAACCACCTGACCAAATTAAGTAAAGGCATGTTCCTTGG 

434 LHNLEYLYLEYNAIKEILPG 
1321 TCTCCATAATCTTGAATACTTATATCTTGAATACAATGCCATTAAGGAAATACTGCCAGG 

454 TFNPMPKLKVLYLNNNLLQV 
1381 AACCTTTAATCCAATGCCTAAACTTAAAGTCCTGTATTTAAATAACAACCTCCTCCAAGT 

474 LPPHIFSGVPLTKVNLKTNQ 
1441 TTTACCACCACATATTTTTTC AGGGGTTCCTCTAACTAAGGTAAATCTTAAAACAAACCA 

494 FTHLPVSMILDDLDLLTQID 
1501 GTTTACC CATCTACCTGTAAGTAATATTTTGGATGATCTTGATTTACTAAC C CAGATTGA 

514 LEDNPWDCSCDLVGLQQWIQ 
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1561 CCTTGAGGATAACCCCTGGGACTGCTCCTGTGACC^ ~~ 

534 KLS KNTVTDDILCTSPGHLD 
1621 AAAGTTAAGCAAGAACACAGTGACAGATGACATCCTCTGCACTTCCCCCGGGCATCTCGA 

554 KKELKALNSEILCPGLVNNP 
1681 CAAAAAGGAATTGAAAGCCCTAAATAGTGAAATTCTCTGTCCAGGTTTAGTAAATAACCC 

574 SMPTQTSYLMVTTPATTTNT 
1741 ATCCATGCCAACACAGACTAGTTACCTTATGGTCACCACTCCTGCAACAACAACAAATAC 

594 ADTILRSLTDAVPLSVLILG 
1801 GGCTGATACTATTTTACGATCTCTTACGGACGCTGTGCCACTGTCTGTTCTAATATTGGG 

614 LL IMFITIVFCAAGIVVLVL 
1861 ACTTCTGATTATGTTCATCACTATTGTTTTCTGTGCTGCAGGGATAGTGGTTCTTGTTCT 

634 HRRRRYKKKQVDEQMRDNSP 
1921 TCACCGCAGGAGAAGATACAAAAAGAAACAAGTAGATGAGCAAATGAGAGACAACAGTCC 

654 VHLQYSMYGHKTTHHTTERP 
1981 TGTGCATCTTCAGTACAGCATGTATGGCCATAAAACCACTCATCACACTACTGAAAGACC 

674 SASLYEQHMVSPMVHVYRSP 
2041 CTCTGCCTCACTCTATGAACAGCACATGGTGAGCCCCATGGTTCATGTCTATAGAAGTCC 

694 SFGPKHLEEEEERNEKEGSD 
2101 ATCCTTTGGTCCAAAGCATCTGGAAGAGGAAGAAGAGAGGAATGAGAAAGAAGGAAGTGA 

714 AKHLQRSLLEQENHSPLTGS 
2161 TGCAAAACATCTCCAAAGAAGTCTTTTGGAACAGGAAAATCATTCACCACTCACAGGGTC 

734 NMKYKTTNQSTEFLSFQDAS 
2221 AAATATGAAATACAAAACCACGAACCAATCAACAGAATTTTTATCCTTCCAAGATGCCAG 

754 SLYRNILEKERELQQLGITE 
22 81 CTCATTGTACAGAAACATTTTAGAAAAAGAAAGGGAACTTCAGCAACTGGGAATCACAGA 

774 YLRKNIAQLQPDMEAHYPGA 
2341 ATACCTAAGGAAAAACATTGCTCAGCTCCAGCCTGATATGGAGGCACATTATCCTGGAGC 

794 HEELKLMETLMYSRPRKVLV 
24 01 CCACGAAGAGCTGAAGTTAATGGAAACATTAATGTACTCACGTCCAAGGAAGGTATTAGT 

814 EQTKNEYFELKAMLHAEPDY 
2461 GGAACAGACAAAAAATGAGTATTTTGAACTTAAAGCTAATTTACATGCTGAACCTGACTA 

834 LEVLEQQT* (SEQ ID NO:657) 
2521 TTTAGAAGTCCTGGAGCAGCAAACATAGatggaga (SEQ ID NO: 656) 
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Figure 3. 158P1D7 amino acid sequence. 



1 MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMLINCEA KGIKMVSEIS 
61 VPPSRPFQLS LLNNGLTMLH TMDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 
121 HNSLEILKED TFHGLENLEF LQADNNFITV IEPSAFSKLN RLKVLILNDN AIESLPPNIF 
181 RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 
241 IIGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 
301 TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER NIESLSDLRP 
361 PPQNPRKLIL AGNIIHSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 
421 NHLTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 
481 GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKMTV 
541 TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 
601 LTDAVPLSVL ILGLLIMFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 
661 YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 
721 LLEQEMHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 
7 81 QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 
841 T* (SEQ ID NO:657) 
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Figure 4. 158P1D7 amino acid BLAST homology to^sypotnenicai. -protein xnurezTy^ 



Identities = 798/798 (100%) 

Query: 44 MLINCEAKGIKJWSEISVPPSRPFQLSLLNNGLTMLHTOT 103 

MLINCEAKGIKMVSEISVPPSRPFQLSLLNNGLTMLHTNDFSGLTNAISIHLGFNNIADI 
Sbjct: 1 MLINCEAKGIKMVSEISVPPSRPFQLSLLNNGLTMLHTNDFSGLTNAISIHLGFNNIADI 60 

Query: 104 EIGAFNGLGLLKQLHINHNSLEILKEDTFHGLEN^ 163 

EIGAFNGLGLLKQLHINHNSLEILKEDTFHGLENLEFLQADNNFITVIEPSAFSKLNRLK 
Sbjct: 61 EIGAFNGLGLLKQLHINHNSLEILKEDTFHGLENLEFLQADNNFITVIEPSAFSKLNRLK 120 

Query: 164 VLILNDNAIESLPPNIFRFVPLTHLDLRGNQLQTLPYVGFLEHIGRILDLQLEDNKWACN 223 

VLILNDNAIESLPPNIFRFVPLTHLDLRGNQLQTLPYVGFLEHIGRILDLQLEDNKWACN 
Sbjct: 121 VLILNDNAIESLPPNIFRFVPLTHLDLRGNQLQTLPYVGFLEHIGRILDLQLEDNKWACN 180 

Query: 224 CDLLQLKTWLENMPPQSIIGDWCNSPPFFKGSILSRLKKESICPTPPVYEEHEDPSGSL 283 

CDLLQLKTWLENMPPQSIIGDWCNSPPFFKGSILSRLKKESICPTPPVYEEHEDPSGSL 
Sbjct: 181 CDLLQLKTWLENMPPQSIIGDWCNSPPFFKGSILSRLKKESICPTPPVYEEHEDPSGSL 240 

fB%> Query: 284 HLAATSSINDSRMSTKTTSILKLPTKAPGLIPYITKPSTQLPGPYCPIPCNCKVLSPSGL 343 

l, f HLAATSSINDSRMSTKTTSILKLPTKAPGLIPYITKPSTQLPGPYCPIPCNCKVLSPSGL 

^ Sbjct: 241 HLAATSSINDSRMSTKTTSILKLPTKAPGLIPYITKPSTQLPGPYCPIPCNCKVLSPSGL 300 

\U Query: 344 LIHCQERNIESLSDLRPPPQNPRKLILAGNIIHSLMKSDLVEYFTLEMLHLGNNRIEVLE 403 

If! LIHCQERNIESLSDLRPPPQNPRKLILAGNIIHSLMKSDLVEYFTLEMLHLGNNRIEVLE 

»£ Sbjct: 301 LIHCQERNIESLSDLRPPPQNPRKLILAGNIIHSLMKSDLVEYFTLEMLHLGNNRIEVLE 360 

|2 Query: 404 EGS FMNLTRLQKL YLNGNHLTKLS KGMFLGLHNLE YLYLEYNAI KE ILPGTFNPMPKLKV 4 63 

t :" EGSFMNLTRLQKLYLNGrffiLTKLSKGMFLGLHNLEYLYLEYNAIKEILPGTFNPMPKLK^ 

sm Sbjct: 361 EGS FMNLTRLQKL YLNGNHLTKLS KGMFLGLHNLE YLYLEYNAIKE ILPGTFNPMPKLKV 420 

f iT; 

Query: 464 LYLNmLLQVLPPHIFSGVPLTKVNLKTNQFTHLPVSNILDDLDLLTQIDLEDNPWDCSC 523 
I™ LYLNNNLLQVLPPHIFSGVPLTKVNLKTNQFTHLPVSNILDDLDLLTQIDLEDNPWDCSC 

IU Sbjct: 421 LYLNNNLLQVLPPHIFSGVPLTKVNLKTNQFTHLPVSNILDDLDLLTQIDLEDNPWDCSC 48 0 

lul 

M; Query: 524 DLVGLQQWIQKLSKNTVTDDILCTSPGHLDKKELKALNSEILCPGLVNNPSMPTQTSYLM 583 
DLVGLQQWIQKLSKNTVTDDILCTSPGHLDKKELKALNSEILCPGLVNNPSMPTQTSYLM 

Sbjct: 481 DLVGLQQWIQKIjSKNTVTDDILCTSPGHLDKKELKALNSEILCPGLVNNPSMPTQTSYLM 540 

Query: 584 VTTPATTTNTADTILRSLTDAVPLSVLILGLLIMFITIVFCAAGIWLVLHRRRRYKKKQ 64 3 

VTTPATTTNTADTILRSLTDAVPLSVLILGLLIMFITIVFCAAGIWLVLHRRRRYKKKQ 
Sbjct: 541 VTTPATTTNTADTILRSLTDAVPLSVLILGLLIMFITIVFCAAGIWLVLHRRRRYKKKQ 600 

Query: 644 VDEQMRDNSPVHLQYSMYGHKTTHHTTERPSASLYEQHMVSPMVHVYRSPSFGPKHLEEE 703 

VDEQMRDNSPVHLQYSMYGHKTTHHTTERPSASLYEQHMVSPMVHVYRSPSFGPKHLEEE 
Sbjct: 601 VDEQMRDNSPVHLQYSMYGHKTTHHTTERPSASLYEQHMVSPMVHVYRSPSFGPKHLEEE 660 

Query: 704 EERNEKEGSDAKHLQRSLLEQENHSPLTGSNMKYKTTNQSTEFLSFQDASSLYRNILEKE 763 

EERNEKEGSDAKHLQRSLLEQENHSPLTGSNMKYKTTNQSTEFLSFQDASSLYRNILEKE 
Sbjct: 661 EERNEKEGSDAKHLQRSLLEQENHSPLTGSNMKYKTTNQSTEFLSFQDASSLYRNILEKE 720 

Query: 764 RELQQLGITEYLRKKTIAQLQPDMEAHYPGAHEELKLMETLMYSRPRKVLVEQTKNEYFEL 823 

RELQQLGITEYLRKNIAQLQPDMEAHYPGAHEELKLMETLMYSRPRKVLVEQTKNEYFEL 
Sbjct: 721 RELQQLGITEYLRKNIAQLQPDMEAHYPGAHEELKLMETLMYSRPRKVLVEQTKNEYFEL 780 



sd-53617 



Query: 824 KANLHAEPDYLEVLEQQT 841 

KANLHAEPDYLEVLEQQT 
Sbjct: 781 KANLHAEPDYLEVLEQQT 798 



Title: NUCLEIC ACID AND CORRESPONDING PROTEIN NAMED 158P1D7 

USEFUL rN THE TREATMENT AND DETECTION OF BLADDER AND 

OTHER CANCERS 

First Inventor: Mary FARI3, et al. 

Application No.: To be assigned 

Docket No.: 51 158-20050.00 

Sheet 6 of 20 



(SEQ ID NO: 658) 



sd-53617 




sd-53624 



158P1D7 :338 LSPSGLLIHCQERNIESLSDLRPPPQNPRKLILAGNIIHSLMKSDLVEYFTLEMLHLGNN 397 
S GL ++CQE+NI+S+S+L P P N +KL + GN I + SD ++ L++LHLG+N 
Sbjct: 368 PSDLGLSVNCQEKNIQSMSELIPKPLNAKKLHVNGNSIKDVDVSDFTDFEGLDLLHLGSN 427 

158P1D7 : 398 RI E VLEEGS FMNLTRLQKLYLNGNHLTKLS KGMFLGLHXXXXXXXXXXAI KEI LPGTFNP 457 
+1 V++ F NLT L++LYLNGN + +L +F GLH IKEI GTF+ 

Sbjct: 428 QITVIKGDVFHNLTNLRRLYLNGNQIERLYPEIFSGLHWLQYLYLEYNLIKEISAGTFDS 487 

158P1D7 :458 MXXXXXXXXXXXXXXXXXXHIFSGVPLTKVNLKTNQFTHLPVSNIXXXXXXXXXXXXXXN 517 
M + IFSG PL ++NL+ N+F +LPVS + N 

SbjCt: 488 MPNLQLLYLNNNLLKSLPVYIFSGAPLARM 547 

158P1D7 : 518 PWDCSCDLVGLQQWIQKLSKNTVTDDILCTSPGHLDKKELKALNSEILCPGLVNNPSMPT 577 
PWDC+CDLV L+ W++KLS V ++ C +P ELK+L +EILCP L+N PS P 

Sbjct: 548 PWDCTCDLVALKLWVEKLSDGIWKELKCETPVQFANIELKSLKNEILCPKLLNKPSAP- 606 

158P1D7 : 578 QTSYLMVXXXXXXXXXXXXILRSLTDAVPLSVIjILGLLIMFITIVFCAAGIWLVLHRRR 637 
+ I VPLS+LIL +L++ I VF A ++V VL R + 

Sbjct: 607 ---FTSPAPAITFTTPLGPIRSPPGGPVPLSILILSILWLILTVFVAFCLLVFVLRRNK 663 

158P1D7:638 RYKKKQVDEQMRDNS PVHLQYSMYGHKTTHHTTERPSASLYEQHMVS PMVHVYRSPSFGP 697 
+ K D+LQ+HK T + E++++S + G 

Sbjct: 664 KPTVKHEGLGNPDCGSMQLQLRKHDHK TNKKDGLSTEAF I PQT I EQMS KSHTCGL 718 

158P1D7 :698 KHLXXXXXXXXXXGSDAKHLQRSLLEQENHSPLTGSNMKYKTTNQSTEFLSFQDASSLYR 757 
K G K + R+ + ++E + + T ++ E +D++ + 

SbjCt: 719 KESETGFMFSDPPGQ- -KVVMRWADKEKX>LLHOT 776 

158P1D7 : 758 NILEKERELQQLGITEYLRKNIAQLQPDMEAHYPGAHEELKLMETLMYSRPRKVLVEQTK 817 
N LE + +E +G++ + E YP + K + ■*-!,+ K++VEQ K 

Sbjct: 777 NFLESKKEYNSIGVSGF EIRYPEKQPDKKSKKSLIGGNHSKIWEQRK 824 

158P1D7:818 NEYFELKAWLHAEPDYLEVLEQQT 841 
+EYFELKA L + PDYL+VLE+QT 
Sbjct: 825 SEYFELKAKLQSS PDYLQVLEEQT 848 (SEQ ID NO:660) 
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Figure 11. 1 58P1 D7 Hydrophilicity profile 

(Hopp TP., Woods K.R., 1981. Proc. Natl. Acad. Sci. U.S.A. 78:3824-3828) 
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Figure 12. 158P1D7 Hydropathicity Profile 

(Kyte J., Doolittle R.F., 1982. J. Mol. Biol. 157:105-132) 



ProtScale output for user sequence 
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Figure 13. 158P1D7 % Accessible Residues Profile 

(JaninJ.,1979. Nature 277:491492) 
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Figure 14. 158P1D7 Average Flexibility Profile 

(Bhaskaran R., Ponnuswamy P.K., 1988. 
Int. J. Pept. Protein Res. 32:242-255) 
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Figure 15. 158P1D7 Beta-turn Profile 

(Deleage, G., Roux B. 1987. Protein Engineering 1:289-294) 
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